Combining IR and LDA Topic Modeling for Filtering Microblogs
نویسندگان
چکیده
منابع مشابه
Modeling Microblogs using Topic Models
As the popularity of micro-blogging increases, managing friends and followers and their tweets is becoming increasingly complex. In this project, we explore the usage of topic models in understanding both text and links in micro-blogs. On a data set of 21306 users, we find that LDA can find good topics that seem to capture meaningful topics of discussion in twitter. We also find that knowing wh...
متن کاملMicroblog Search Task at CLEF 2017: Query Generation using IR and LDA Topic Modeling Combination
The microblogs search task at CLEF 2017 consists of developing a system to search the most relevant microblogs for cultural query in a collection about festivals in all languages. Our general approach to get this objective is the following: we propose to generate from the initial tweet queries, provided for the task, extended queries able to get an answer-rich set of microblogs. This is achieve...
متن کاملTopic Modeling using LDA with Feedback Mechanisms
Topic models provide a way to identify the latent topics from a collection of documents. Although the identified topics often appear quite representative of the data; just as often, there are parts of the output that appear erroneous or otherwise difficult to interpret by humans. This is a limitation of topic models that can be remedied by user feedback mechanisms. In this paper, I discuss two ...
متن کاملEntities as topic labels: Improving topic interpretability and evaluability combining Entity Linking and Labeled LDA
Hurvitz, A. (2013). Late Biblical Hebrew, Khan. Khan, G. (ed.) (2013). Encyclopedia of Hebrew Language and Linguistics, Vol. 4, Leiden, Brill, 2013. Kutscher, E. Y. (1974). The Language and Linguistic Background of the Isaiah Scroll (1QIsaa), STDJ 6. Leiden, Brill. Oosting, R., Dyk, J. and Glanz, O., Valence Patterns of Motion Verbs, Semantics, Syntax and Linguistic Variation, to be published. ...
متن کاملProbabilistic Topic and Syntax Modeling with Part-of-Speech LDA
This article presents a probabilistic generative model for text based on semantic topics and syntactic classes called Part-of-Speech LDA (POSLDA). POSLDA simultaneously uncovers short-range syntactic patterns (syntax) and long-range semantic patterns (topics) that exist in document collections. This results in word distributions that are specific to both topics (sports, education, ...) and part...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Computer Science
سال: 2017
ISSN: 1877-0509
DOI: 10.1016/j.procs.2017.08.166